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Abstract 

Clustering plays an important role in data mining, pattern recognition, and machine learning. 
Then, single valued neutrosophic sets (SVNSs) are useful means to describe and handle 
indeterminate and inconsistent information, which fuzzy sets and intuitionistic fuzzy sets cannot 
describe and deal with. To cluster the data represented by single value neutrosophic information, the 
paper proposes a single valued neutrosophic minimum spanning tree (SVNMST) clustering 
algorithm. Firstly, we defined a generalized distance measure between SVNSs. Then, we present a 
SVNMST clustering algorithm for clustering single value neutrosophic data based on the generalized 
distance measure of SVNSs. Finally, two illustrative examples are given to demonstrate the 
application and effectiveness of the developed approach. 
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1. Introduction 

Clustering plays an important role in data mining, pattern recognition, and machine learning. 
Clustering data sets into disjoint groups is a problem arising in many domains. Generally, the goal of 
clustering is to find groups that are both homogeneous and well separated, that is, entities within the 
same group should be similar and entities in different groups dissimilar. Then, data sets can be 
represented by weighted graphs, where nodes correspond to the entities to be clustered and edges 
correspond to a dissimilarity or similarity measure between those entities. Graph theory [1] provided 
us with a very convenient tool to describe clustering problems. However, a minimum spanning tree 
(MST) is a very useful graph structure and can capture perceptual grouping [2]. Zahn [3] defined 
several criteria of edge inconsistency for detecting clusters of different shapes and proposed the 
clustering algorithm using MST. Xu et al. [4] introduced three MST algorithms and applied them to 
clustering gene expression data. Due to the fuzziness and uncertainty of many practical problems in 
the real world, Ruspini [5] first presented the concept of fuzzy division and a fuzzy clustering 
approach. Dong et al. [6] introduced a hierarchical clustering algorithm based on fuzzy graph 
connectedness. Then, Chen et al. [7] put forward a fuzzy graph maximal tree clustering method of 
the fuzzy graph constructing the fuzzy similarity relation matrix and used the threshold of fuzzy 
similarity relation matrix to cut maximum spanning tree, and then obtained the classification on level. 
Zhao et al. [8] proposed two intuitionistic fuzzy MST (IFMST) clustering algorithms to deal with 
intuitionistic fuzzy information and extended them to clustering interval-valued intuitionistic fuzzy 
information. Furthermore, Zhang and Xu [9] introduced a MST algorithm-based clustering method 
under hesitant fuzzy environment. 
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To represent uncertainty, imprecise, incomplete, and inconsistent information which exist in 
real world, Smarandache [10] gave the concept of a neutrosophic set from philosophical point of 
view. The neutrosophic set [10] is a powerful general formal framework which generalizes the 
concept of the classic set, fuzzy set, interval valued fuzzy set, intuitionistic fuzzy set, interval valued 
intuitionistic fuzzy set, paraconsistent set, dialetheist set, paradoxist set, and tautological set. In the 
neutrosophic set, truth-membership, indeterminacy-membership, and falsity-membership are 
represented independently. However, the neutrosophic set generalizes the above mentioned sets from 
philosophical point of view and its functions T A (x), I A (x) and F A (x) are real standard or nonstandard 
subsets of ]“0, l + [, i.e., T A (x): X — > ]“0, l + [, I A (x): X —*■ ]“0, l + [, and F A (x ): X — > ]“0, l + [. So it will be 
difficult to apply in real scientific and engineering areas. Thus, Wang et al. [11] introduced a single 
valued neutrosophic set (SVNS), which is an instance of a neutrosophic set. It can describe and 
handle indeterminate information and inconsistent information, which fuzzy sets and intuitionistic 
fuzzy sets (IFSs) cannot describe and deal with. Recently, Ye [12, 13] presented the correlation 
coefficient of SVNSs and the cross-entropy measure of SVNSs and applied them to single valued 
neutrosophic decision-making problems. Yet, until now there have been no study on clustering the 
data represented by single valued neutrosophic information. However, the existing MST clustering 
algorithms cannot cluster the single valued neutrosophic data. Therefore, this paper proposes a single 
valued neutrosophic MST (SVNMST) clustering algorithm to deal with the data represented by 
SVNSs. To do so, the rest of the article is organized as follows. Section 2 introduces some basic 
concepts of SVNSs, the graph and its MST. Section 3 defines a generalized distance measure 
between SVNSs. In Section 4, a single valued neutrosophic clustering algorithm is proposed based 
on the MST. In Section 5, two illustrative examples are given to demonstrate the applications and the 
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effectiveness of the proposed approach. Conclusions and further research are contained in Section 6. 

2. Preliminaries 

In this section, some basic concepts of SVNSs, the graph and its MST are introduced to be 
utilized in the next sections. 

2.1. Some concepts of SVNSs 

Smarandache [10] introduced the concept of a neutrosophic set from philosophical point of 

view. 

Definition 1 [10]. Let X be a space of points (objects), with a generic element in X denoted by x. A 
neutrosophic set A in A is characterized by a truth-membership function T A (x), an 
indeterminacy-membership function l A (x), and a falsity- membership function F A (x). The functions 
T a (x ), l A (x) and F A {x) are real standard or nonstandard subsets of ]~0, l + [. That is T A (x)\ X — > ]~0, l + [, 
I A (x): X — > ]“0, l + [, and F A (x): X —* ]~0, l + [, with the condition ~0 < sup T A (x) + sup I A (x ) + sup F A (x) 
<3 + 

Obviously, it is difficult to apply in real scientific and engineering fields. Thus, Wang et al. 
[11] introduced the concept of a SVNS, which is an instance of a neutrosophic set. 

Definition 2 [11]. Let A be a space of points (objects) with generic elements in A denoted by x. A 
SVNS A in A is characterized by truth-membership function T A (x). indeterminacy-membership 
function I A (x), and falsity-membership function F A (x). For each point x in A, there are T A (x), I A (x), 
F a (x) e [0, 1] and 0 < T A (x ) + I A (x ) + F A (x) < 3. Therefore, a SVNS A can be represented by 

A = {( x,T a (x), I a (x), F a (x)} I x e a}. 

The following expressions are defined in [11] for SVNSs A, B: 

(1 ) A £ B if and only if T A (x) < T B (x), l A (x) >I B (x), F A (x) > F B (x) for any x in A, 

4 

http://mc.manuscriptcentral.com/jisys 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 



Journal of Intelligent Systems 

(2 )A=B if and only if A □ B and B □ A, 

(3) A c = {(x, F a (x),1 - 1 A (x),T a (x)} I x e A } . 

2.2. Graph and minimum spanning trees 

A graph G is a pair of sets G = {V, E), where V is the set of vertices (or nodes) and E is the set 
of edges. There are two kinds of graphs: an undirected graph and a directed graph. Each edge in the 
undirected graph is an unordered pair { v,, v,}. while each edge in the directed graph is an ordered 
pair { v,, v,}, where the vertices v,- and vj are called the endpoints of an edge. A sequence of edges and 
vertices that can be traveled between two different vertices is called a path. Suppose 5 nodes are 
given, then a graph with 5 nodes and 7 edges is shown in Fig. 1 . 



A 




Fig. 1 Graph 

In Fig. 1 , there are different paths from the node B to the node E, such as the path (B AE) and 
the other path (BCDE). For a path, the start node and destination node is the same, which is called a 
circuit, like (ABC A) or (ADEA). 

A connected graph has paths between any pair of nodes. A connected acyclic graph that 
contains all nodes of G is called a spanning tree of the graph, which is any set of straight line 
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segments connecting pairs of nodes such that: 

(1) no closed loops occur; 

(2) each node is visited by at least one line; 

(3) a tree is connected. 

For example, a spanning tree with integer segment lengths is shown in Fig. 2, which is 
obtained from Fig. 1 . When the nodes B and C or D and E were connected, a closed loop would be 
formed and the resulting figure would not be a tree. Then, we define the length of a tree to be the 
sum of the lengths of its constituent edges, the length of the tree in Fig. 2is4 + 6 + 5 + 8=23 units. 



A 




Fig. 2 Spanning tree 



6 



http://mc.manuscriptcentral.com/jisys 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 



Journal of Intelligent Systems 



A 




Fig. 3 Minimum spanning tree 

A MST is the spanning tree of the minimum length which is often required. For instance. Fig. 
3 shows the MST obtained from Fig. 1, and then its minimum length is 4 + 3 + 2 + 5 = 14 units. 

Various algorithms to find the MST have been proposed in [4, 14, 15]. However, there are the 
two most popular algorithms to find the MST of a graph G by operating iteratively [14, 15], which 
are needed later. At any stage the segment belong to one of two sets, i.e., the set A containing those 
segments assigned to the MST and the set B, those not assigned. 

The algorithm given by Kruskal [14] is to assign interactively to the set A the shortest segment 
in the set B which does not form a closed loop with any of the segments already in A. Initially A is 
empty and the iteration stops when A contains (n- 1) segments. 

The algorithm given by Prim [15] starts with any one of the given nodes and initially assign to 
A the shortest segment starting from this node. The procedure continues to add the shortest segment 
from B which connects to at least one segment from A without forming a closed loop amongst the 
segments already in A. The iteration stops when A contains («-l) segments. 

Usually, clustering data sets can be represented as weighted graphs, where nodes correspond 
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to the entities to be clustered and edges correspond to distance measure (or called dissimilarity 
measure) between those entities. If a fuzzy relation R over VxV is defined, then the membership 
function /i R (v\, V 2 ), where (vi, V 2 ) e VxV, takes various values from 0 to 1, and then such a graph is 
called a fuzzy graph. When R is an intuitionistic fuzzy relation over VxV, then such a graph is called 
an intuitionistic fuzzy graph [8]. 

3. Distance measures of SVNSs 

For two SVNSs A and B in a universe of discourse X = {xi, Xi, . .., x„}, which are denoted by 
A = {(x i ,T A (x i ),I A (x,),F A (x i ))\x i ex} and B = {{x i ,T B (x l ),I B (x l ),F B (x l ))\ x, e x} , where T A (x,), I A (x t ), 
F A (xj), T B (xi), Isixj), F B (Xj) e [0, 1] for every x,- e X. Let us to consider the weight w,- (i = 1,2, ..., n) 
of an element x, (i = 1, 2, . .., ri), with w,- > 0 (i = 1,2, . . ., n ) and 



V" w = 1 

x—n = 1 ' 



. Then, we define the 



generalized single valued neutrosophic weighted distance: 



1 ^-, 



d A (A, fl) = |- Wj |r A (x ; ) - T b (x ; )| +\l A (x,)-I B (x,)\ +\F A (x,)-F B {x i )\ 



1/A 



, (i) 



where A > 0. 



Especially, if A = 1,2, Eq. (1) reduces to the single valued neutrosophic weighted Hamming 
distance and the single valued neutrosophic weighted Euclidean distance, respectively, as follows: 

di (A, B) = ^Y j w i |r A (x, ) - T b ( x , )| + 1 1 A (x ; ) - 1 B (x ; )| + 1 F a (x, ) - F B (x, )|] , (2) 

3 i = 1 



1^ 



d 2 (A,B) = ^w i \T A (x i )-T B (x i )\ +|/ A (x,.)-/ B (x,.)|- + |F A (x,.)- J F B (x,.)f 



(3) 



Then, for the distance measure we have the following proposition. 



Proposition 1. The above defined distance d^(A, B) for A> 0 satisfies the following properties: 



(PI) 4(4, B)> 0; 



(P2) 4(4, B) = 0 if and only if A = B\ 
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(P3) dx(A, B) = dx(B, A); 

(P4) If A c; B c C, C is a SVNS in X , then d,(A , C) > d,(A . B ) and d-,(A , C) > d-, (B, C). 

Proof. It is easy to see that d>(A, B) satisfies the properties (P1)-(P3). Therefore, we only prove (P4). 
Let AcBcC, then, T A (x t ) < T K (x,) < Tdxd, I A (xd > I B (xd ^ 7 c (a',), and F A (xi) > F B (xd > Fdx ,) for 
every x t e X. Then, we obtain the following relations: 

|r A (^.)-r B u i )j 1 <|r A (*,.)-r c (x,)|\ |r B (x,.)-r c u ( )f <\t a (x,)~t c (x,)\\ 

\t a( x i) - 1 b( x !W { 7 b( x i)- 

| F a (Xj ) - F b (Xtf < | F a (X,. ) - F c (X,. )| * , |F b (x f ) - F c (x ; )|* < |F a (x, ) - F c (x ; jff ■ 

Hence 

|r A (x i )-7’ 8 (x i )| A +|/ A (x,.)-/ B (x,.f +|# A (X,.)-F B (X 1 .)|' < 7 a (x,)-7;.(x,) ; - + / A (x I )-/ r (x i )" +|F A (x,.)-F c (x j )|" , 
|r B (x,.)-r c (x,.)| i +|/ B (x ( )-/ C (x,.|jf +|F B (x j )-F c (x 1 -)| i <Jr A (x,.)-F c (x i )|' 1 +|/ A (x,.)-/ C (x i )| i +|F A (x,.)-F c (x I .)| i • 

Combining the above inequalities with the above defined distance formula (1), we can obtain 

that 

d A (A,B)<d A (AC) and d A (B,C) < d A {A,C) forl>0. 

Thus the property (P4) is obtained. □ 

In the following, we give the definition of single valued neutrosophic distance matrix. 
Definition 3. Let A, (j = 1, 2, .... m) be a collection of m SVNSs, then D = (djj) mxm is called a single 
valued neutrosophic distance matrix, where c/, ; = dAA,, Aj) is the distance between A, and A, and its 
properties are as follows: 

(1) 0<4<1 for all U=l,2 m; 

(2) djj = 0 if and only if A, = A,-; 

(3) djj = dji for all i,j =1, 2, .... m. 
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4. SVNMST clustering algorithm 

In this section, a SVNMST clustering algorithm is proposed as a generalization of an IFMST 
clustering algorithm [8] . 

Let X = [xi, X 2 , . .., x„ \ be an attribution space and the weight vector of an element x,- (i = 1, 



2, m) is a collection of m SVNSs, which express m samples to be clustered. Then there is the 
following form: 



Thus, we propose a SVNMST clustering algorithm, which is described by the following steps: 

Step 1. Calculate the distance of dy = d^(A h A , ) (i, j = 1, 2, ..., m) by Eq. (1) to establish the 
single valued neutrosophic distance matrix D = (dy) mxm . 

Step 2. Draw the single valued neutrosophic graph G(V, E) where every edge between A, and 
Aj is assigned the weight (single valued neutrosophic distance) dy coming from an element of the 
single valued neutrosophic distance matrix D = (dy) mxm , which represents the dissimilarity degree 
between the samples A, and Aj. 

Step 3. Build the MST of the single valued neutrosophic graph G(V, E) by Kruskal’s method 
[14] or Prim’s method [15]: 

(1) Sort the edges of G in increasing order by weights. 

(2) Keep a sub-graph S of G, which is initially empty, and at each step choose the edge e with 
the smallest weight to be added to the sub-graph S, where the endpoint of e are disconnected. 

(3) Repeat the process (2) until the sub-graph S spans all vertices. Hence, the MST of the 
single valued neutrosophic graph G( V, E) is obtained. 



2, be w = [wi, W 2 , ..., vv„] with w,- > 0 (/ = 1,2, ..., n) and ^ vt'- = 1 . Assume that Aj (j = 1, 
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Step 4. Perform clustering by use of the SVNMST. We can get a certain number of sub-trees 
(clustering) by disconnecting all the edges of the MST with weights greater than a threshold r. The 
clustering results induced by the sub-trees do not depend on some particular MST [ 8 , 9]. 

5. Illustrative examples 

In this section, two illustrative examples are presented to demonstrate the real applications and 
the effectiveness of the proposed approach. 

Example 1. A car market is going to classify eight different cars of Aj (j = 1,2, ..., 8 ). Every car has 
six evaluation factors (attributes): (1) X\. fuel consumption; (2) x 2 : coefficient of friction; (3) xj : price; 
(4) x 4 : comfortable degree; (5) x 5 : design; ( 6 ) x$: security coefficient. The characteristics of each car 
under the six attributes are represented by the form of SVNSs, and then the single valued 
neutrosophic data are as follows: 

A\ = {< xi, 0.3, 0.2, 0.5>, < X 2 , 0.6, 0.3, 0.1>, < X 3 , 0.4, 0.3, 0.3>, < x 4 , 0.8, 0.1, 0.1>, < X 5 , 0.1, 
0.3, 0.6>, < x 6 , 0.5, 0.2, 0.4>), 

A 2 = {< xi, 0.6, 0.3, 0.3>, < X 2 , 0.5, 0.4, 0.2>, < X 3 , 0.6, 0.2, 0.1>, < x 4 , 0.7, 0.2, 0.1>, < X 5 , 0.3, 
0.1, 0.6>, < x 6 , 0.4, 0.3, 0.3>}, 

A 3 = {< xu 0.4, 0.2, 0.4>, < X 2 , 0.8, 0.2, 0.1>, < X 3 , 0.5, 0.3, 0. 1 >, < x 4 , 0.6, 0.1, 0.2>, < X 5 , 0.4, 
0.1, 0.5>, < x 6 , 0.3, 0.2, 0.2>}, 

A 4 = {< Xi, 0.2, 0.4, 0.4>, < X 2 , 0.4, 0.5, 0.1>, < X 3 , 0.9, 0.2, 0.0>, < x 4 , 0.8, 0.2, 0. 1 >, < X 5 , 0.2, 
0.3, 0.5>, < x 6 , 0.7, 0.3, 0.1>}, 

A 5 = {< xi, 0.2, 0.3, 0.3>, < x 2 , 0.3, 0.2, 0.6>, < X 3 , 0.5, 0.1, 0.4>, < x 4 , 0.7, 0.1, 0.1>, < X 5 , 0.4, 
0.2, 0.4>, < x 6 , 0.3, 0.2, 0.6>), 

A 6 = {< xi, 0.3, 0.2, 0.4>, < X 2 , 0.2, 0.1, 0.7>, < X 3 , 0.4, 0.2, 0.5>, < x 4 , 0.8, 0.0, 0.1>, < X 5 , 0.4, 
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0.3, 0.5>, < x 6 , 0.2, 0.1, 0.7>j, 

A 7 = {< xi, 0.4, 0.4, 0.3>, < X 2 , 0.5, 0.3, 0.1>, < X 3 , 0.6, 0.1, 0.2>, < X 4 , 0.2, 0.3, 0.7>, < jc 5 , 0.3, 
0.1, 0.5>, < x 6 , 0.7, 0.2, 0.1>), 

A 8 = {< xi, 0.4, 0.1, 0.2>, < X 2 , 0.6, 0.1, 0.1>, < X 3 , 0.8, 0.2, 0.1>, < x\, 0.7, 0.2, 0.1>, < X5, 0.1, 
0.1,0.8>,<x 6 , 0.2, 0.1, 0.8>}. 

If the weight vector of the attributex,- (i = 1, 2, ..., 6) is w = (0.16, 0.12, 0.25, 0.2, 0.15, 0.12) T . 
Then we utilize the S VNMST clustering algorithm to group the eight different cars of Aj (J = 1 , 2, . . . , 
8 ): 

Step 1. Calculate the distance d,j = d;(Aj, Aj) by Eq. (1) (take 7, = 2). Then we can establish the 
single valued neutrosophic distance matrix D = U/,,),,,*,,, as follows: 



0.0000 


0.2737 


0.2551 


0.3583 


0.4022 


0.4559 


0.5052 


0.4100 


0.2737 


0.0000 


0.2269 


0.2605 


0.4064 


0.5334 


0.4271 


0.4117 


0.2551 


0.2269 


0.0000 


0.3311 


0.4577 


0.5527 


0.3983 


0.4450 


0.3583 


0.2605 


0.3311 


0.0000 


0.5442 


0.6720 


0.4559 


0.5794 


0.4022 


0.4064 


0.4577 


0.5442 


0.0000 


0.1987 


0.5957 


0.4811 


0.4559 


0.5334 


0.5527 


0.6720 


0.1987 


0.0000 


0.7155 


0.5248 


0.5052 


0.4271 


0.3983 


0.4559 


0.5957 


0.7155 


0.0000 


0.6370 


0.4100 


0.4117 


0.4450 


0.5794 


0.4811 


0.5248 


0.6370 


0.0000 



Step 2. Draw the single valued neutrosophic graph G(V, E) where every edge between A, and 
Aj ( i,j= 1,2, ..., 8) is assigned the weight (single valued neutrosophic distance) djj coming from an 
element of the single valued neutrosophic distance matrix D = (d,j) mxm , which represents the 
dissimilarity degree between the samples A, and A,-. Then, the single valued neutrosophic graph G(V, 
E ) is shown in Fig. 4. 
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<4i A 2 




Fig. 4 Single valued neutrosophic graph G( V. E) with the eight nodes 
Step 3. Establish the MST of the single valued neutrosophic graph G(V, E) by Kruskal’s 
method [14] or Prim’s method [15]: 

(1) Sort the edges of G in increasing order by weights: 

c/56 < c/23 < dtf < c/24 < c/12 < c/34 < d\4 < c/37 < c/15 < c/25 < c/l8 < c/28 < c/27 < c/38 < c/16 = c/47 < c/35 
< c/58 < c/17 < c/68 < c/26 < c/45 < c/36 < c/48 < c/57 < c/78 < c/46 < d(f! 

(2) Keep an empty sub-graph S of G and add the edge e with the smallest weight to S, where 
the endpoints of e are disconnected, thus we choose the edge e 5 6 between A5 and A6. 

(3) Repeat the process (2) until the sub-graph S spans eight nodes. Thus, the MST of the single 
valued neutrosophic graph G(V, E) is obtained, as shown in Fig. 5. 
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Fig. 5 The MST of the single valued neutrosophic graph G(V, E) with the eight nodes 
Step 4. Select a threshold r and disconnect all the edges of the MST with weights greater than 
r to obtain a certain number of sub-trees (clusters), as listed in Table 1. 



Table 1. SVNMST clustering results of the eight different cars 



r 


Corresponding to clustering result 


r = dm = 0.41 


Mi, A 2 , A 3 , A 4 , A 5 , Ag, A 7 , As} 


r = di 5 = 0.4022 


{ Ai, A 2 , A 3 , A 4 , A 5 , Ag, A 7 }, { Ag } 


r = d v = 0.3983 


{ A 1 ,A 2 ,A 3 ,A 4 ,A 7 },{A 5 ,A 6 }, {As} 


r = <724 = 0.2605 


{A i ,A2,A3,A 4 },{A 5 ,A 6 },{A 7 }, {As} 


r = dm = 0.2551 


{A 1 ,A 2 ,A 3 },{A 4 },{A 5 ,A 6 },{A 7 },{A 8 } 


r =d 23 = 0.2269 


{ Ai}, {A 2 , A 3 }, {A 4 }, {A s , A 6 }, { A 7 } , {A 8 } 


r=d 56 = 0.1987 


{ Ai}, {A 2 }, { A 3 } , {A 4 } , {A 5 , A 6 }, { A v } , {As} 


r = 0 


{ Ai}, {A 2 }, { A 3 } , {A 4 }, {A 5 } , {A 6 }, {A 7 }, {As} 



To compare the SVNMST clustering algorithm with the intuitionistic fuzzy MST clustering 
algorithm and the fuzzy MST clustering algorithm, we introduce the following example discussed in 
[ 8 , 9] for comparative convenience. 

Example 2. To complete an operational mission, the six sets of operational plans are made (adapted 
from [ 8 , 9]). To group these operational plans with respect to their comprehensive function, a 
military committee has been set up to provide assessment information on them. The attributes which 
are considered here in assessment of the six operational plans Aj (j = 1,2, ..., 6 ) are: (1) xi is the 
effectiveness of operational organization; and ( 2 ) X 2 is the effectiveness of operational command. 
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The weight vector of the attributes x* (i =1, 2) is w = (0.45, 0.55) T The military committee evaluates 
the performance of the six operational plans Aj (/' =1, 2, 6) with respect to the attributes x,- (i = 1, 

2) and gives the SVNSs as follows: 

A, = {< X!, 0.7, 0.2, 0.15>, <x 2 , 0.6, 0.3,0.2>}, 

A 2 = {<*!, 0.4, 0.3, 0.35>, <x 2 , 0.8, 0.1, 0.1>}, 

A 3 = {<*!, 0.55, 0.2, 0.25>, <x 2 , 0.7, 0.1, 0.1 5>}, 

A 4 = {<xi, 0.44, 0.2, 0.35>, <x 2 , 0.6, 0.2, 0.2>}, 

A 5 = {< x u 0.5, 0.15, 0.35>, < x 2 , 0.75, 0.1, 0.2>}, 

A 6 = {< x u 0.55, 0.2, 0.25>, < x 2 , 0.57, 0.2, 0.15>}. 

Then we employ SVNMST clustering algorithm to group these operational plans A, (j = 1 . 
2, ..., 6): 

Step 1. Calculate the distance dy = d,(A„ Aj) by Eq. (1) (take A = 2). Then we can establish the 
single valued neutrosophic distance matrix D = ( djj) mxm as follows: 



0.0000 


0.2327 


0.1507 


0.1637 


0.1940 


0.1051 


0.2327 


0.0000 


0.1127 


0.1326 


0.1131 


0.1546 


0.1507 


0.1127 


0.0000 


0.1056 


0.0764 


0.0802 


0.1637 


0.1326 


0.1056 


0.0000 


0.0940 


0.0784 


0.1940 


0.1131 


0.0764 


0.0940 


0.0000 


0.1210 


0.1051 


0.1546 


0.0802 


0.0784 


0.1210 


0.0000 



Step 2. Draw the single valued neutrosophic graph G(V, E) where every edge between A, and A, 
(/, j = 1,2, ..., 6) is assigned the weight (single valued neutrosophic distance) dy coming from an 
element of the single valued neutrosophic distance matrix D = ( dy) lnxm , which represents the 
dissimilarity degree between the samples A, and A,. The single valued neutrosophic graph G(V, E) is 
shown in Fig. 6. 
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A^ 




Fig. 6 Single valued neutrosophic graph G(V, E) with the six nodes 
Step 3. Build the MST of the single valued neutrosophic graph G(V, E ) by Kruskal’s method 
[14] or Prim’s method [15]: 

(1) Sort the edges of G in increasing order by weights: 

^35 < ^46 < d36 < ^45 < <^16 < t/34 < t/23 < d 2 5 < ds 6 < c/24 < ^13 < <^26 < ^14 < <^15 < ^12- 

(2) Keep an empty sub-graph S of G and add the edge e with the smallest weight to S. where 
the endpoints of e are disconnected, thus we choose the edge £35 between A3 and A 5 . 

(3) Repeat the process (2) until the sub-graph S spans six nodes. Thus, the MST of the single 
valued neutrosophic graph G(V , E) is obtained, as shown in Fig. 7. 
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A-\ A 2 




Fig. 7 The MST of the single valued neutrosophic graph G(V, E) with the six nodes 
Step 4. Select a threshold r and disconnect all the edges of the MST with weights greater than 
r to obtain a certain number of sub-trees (clusters), as listed in Table 2. 



Table 2. SVNMST clustering results of the six operational plans 



r 




Corresponding to clustering result 


r = <i 23 = 0.1127 


{Ai, A 2 , A 3 , A 4 , A 5 , A^} 


r = d l6 = 0.1051 


{A 2 }, {A 1; a 3 , a 4 , a 5 , A 6 } 


r = dx, = 0.0802 


{Ai 


},{A 2 }, {A 3 ,A 4 ,A 5 ,A 6 } 


r = d 4 6 = 0.0784 


{Ai 


},{A 2 }, {A 3 ,A 5 },{A 4 ,A 6 } 


r = <7 35 = 0.0764 


{Ai 


},{A 2 }, {A 3 ,A 5 }, {A 4 }, {A 6 } 


r = 0 


{Ai 


}, { A 2 } , {A 3 }, {A 4) , {A 5 }, {A 6 } 



To compare the SVNMST clustering algorithm with the IFMST clustering algorithm in [ 8 ], 
assume that the indeterminacy-membership function Ia(x,) is not considered independently in a 
SVNS Aj, then the samples’ information given by the military committee will be the intuitionistic 
fuzzy data (adopted in [ 8 ]): 

A t = { < X[ , 0.7, 0.15>, < X 2 , 0.6, 0.2 >}, A 2 = [<x\, 0.4, 0.35>, < x 2 , 0.8, 0.1>}, 



A 3 = {< x u 0.55, 0.25>, < x 2 , 0.7, 0.15>}, A 4 = {< x u 0.44, 0.35>, < x 2 , 0.6, 0.2>}, 



A s = {< x u 0.5, 0.35>, < x 2 , 0.75, 0.2>}, A 6 = {< x u 0.55, 0.25>, < x 2 , 0.57, 0.15>}. 



Then, the operational plans Aj (j = 1,2, ..., 6 ) can be clustered by the following IFMST 
clustering algorithm [ 8 ] : 



17 



http://mc.manuscriptcentral.com/jisys 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 



Journal of Intelligent Systems 



Step 1 . Calculate t/, ; = d(Aj, Aj) by the intuitionistic fuzzy distance measure: 



d(A,B) = \±± Wi 



, 1/2 



+kOi)-* / fl(*i)| +W A (x i )-XB(x i t 



(4) 



Thus, we get the intuitionistic fuzzy distance matrix: 



0.0000 


0.2450 


0.1225 


0.1170 


0.1725 


0.1115 


0.2450 


0.0000 


0.1225 


0.1280 


0.1000 


0.1940 


0.1225 


0.1225 


0.0000 


0.1045 


0.1000 


0.0715 


0.1170 


0.1280 


0.1045 


0.0000 


0.1095 


0.0935 


0.1725 


0.1000 


0.1000 


0.1095 


0.0000 


0.1715 


0.1115 


0.1940 


0.0715 


0.0935 


0.1715 


0.0000 



Step 2. Structuring the intuitionistic fuzzy graph G = ( V , E), see Step 2 in the SVNMST 



clustering algorithm and Fig. 6. 



Step 3. Get the MST of the intuitionistic fuzzy graph G = (V, E) by Kruskal’s method [14] or 



Prim’s method [15]: 



(1) Sort the edges of G in increasing order by weights: 



r/36 < d\( f < ^35 — ^25 ^ (7)4 < 6/45 < d \ 6 < d]4 < d 13 — d 23 < do 4 < d^fr < d \ 5 < ^26 < d^o- 



(2) Keep an empty sub-graph S of G and add the edge e with the smallest weight to S, where 
the endpoints of e are disconnected, thus we choose the edge e\ ft between A3 and A fi . 

(3) Repeat the process (2) until the sub-graph S spans six nodes. Thus, the MST of the 
intuitionistic fuzzy graph G(V , E) is obtained, as shown in Fig. 8. 
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Fig. 8 The MST of the intuitionistic fuzzy graph G = (V, E) 

Step 4. Select a threshold r and disconnect all the edges of the MST with weights greater than 
r to obtain a certain number of sub-trees (clusters), as listed in Table 3. 

Table 3. IFMST clustering results of the six operational plans 

r Corresponding to clustering result 

{Ai, A 2 , A 3 , A 4 , A 5 , A 6 } 

{A 1 },{A 2 ,A 3 ,A 4 ,A 5 ,A 6 ) 

{Ail, {A 2 }, {A 5 }, {A 3 ,A 4 ,A 6 } 
{A i },{A 2 },{A 4 },{A 5 },{A 3 ,A 6 } 

{At}, {A 2 }, {A 3 }, {A 4 }, {A 5 }, { A 6 } 

It is well know that the fuzzy set is only composed of the membership degree. Therefore, we 
only consider the membership degrees of intuitionistic fuzzy data, and then the operational plan 
information given by the following fuzzy data [8]: 



r = d 16 = 0.1115 
/' = d 2 5 = d 2 5 = 0.1 
r = d^ = 0.088 
r = d 26 = 0.07 1 5 
r= 0 



A i = { < X] , 0.7>, <x 2 , 0.6>}, A 2 = {< *i, 0.4>, <x 2 , 0.8>}, A 3 = {<Xi, 0.55>, <x 2 , 0.7>}, 



A 4 = {< X|, 0.44>, < x 2 , 0.6>}, A 5 = {< X|, 0.5>, <x 2 , 0.75>), A 6 = {<xi, 0.55>, < x 2 , 0.57>). 



Then, the operational plans Aj (J =1, 2, ..., 6) can be clustered by the fuzzy MST clustering 
algorithm: 
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Step 1 . Calculate t/, ; = d(Aj, Aj) by the fuzzy distance measure: 



|i/2 

Thus, we obtain the fuzzy distance matrix: 



0.0000 


0.2500 


0.1250 


0.1744 


0.1743 


0.1031 


0.2500 


0.0000 


0.1250 


0.1507 


0.0766 


0.1980 


0.1250 


0.1250 


0.0000 


0.1046 


0.0500 


0.0964 


0.1744 


0.1507 


0.1046 


0.0000 


0.1183 


0.0771 


0.1743 


0.0766 


0.0500 


0.1183 


0.0000 


0.1376 


0.1031 


0.1980 


0.0964 


0.0771 


0.1376 


0.0000 



d(A,B) = \Y j w l \ J u A (x i )- jU B (x i )| 



(5) 



Step 2. Draw the fuzzy graph G = ( V , E). See also Step 2 in the SVNMST clustering algorithm 
and Fig. 6. 

Step 3. Build the MST of the fuzzy graph G = ( V , E) by Kruskal’s method [14] or Prim’s 
method [15]: 

(1 ) Sort the edges of G in increasing order by weights: 

r/35 < d 25 < 6^46 < ^36 ^ d 16 < d ^4 < 6/45 < d \ 3 = r/23 < r/56 < di ^ < d \ 5 < d 1 4 < d 26 < d \ i - 

(2) Keep an empty sub-graph S of G and add the edge e with the smallest weight to S, where 
the endpoints of e are disconnected, thus we choose the edge c'35 between A3 and A5. 

(3) Repeat the process (2) until the sub-graph S spans six nodes. Thus, the MST of the fuzzy 
graph G(V, E) is obtained, as shown in Fig. 9. 
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Fig. 9 The MST of the fuzzy graph G = ( V , E ) 

Step 4. Select a threshold r and disconnect all the edges of the MST with weights greater than 
r to obtain a certain number of sub-trees (clusters), as listed in Table 4. 



Table 4. Fuzzy MST clustering results 



r 




Corresponding to clustering result 


r = d 16 = 0.1031 


A 2 , A 3 , A), A 5 , A 6 } 


r = djb = 0.0964 


{A, 


}, {Aj, A3, A4, A5, A^} 


r = d A6 = 0.0774 


{Ai 


},{A 2 ,A 3 ,A 5 }, {A 4 ,A 6 } 


r = fi?25 = 0.0766 


{Ai 


},{A 2 ,A 3 ,A 5 },{A 4 }, {A 6 } 


r = <7 35 = 0.0500 


{Ai 


},{A 2 },{A 3 ,A 5 }, {A 4 }, {A 6 } 


r=0 


{Ai 


}, {A 2 }, {A 3 }, { a 4 } . {A 5 }, {A 6 } 



For convenient comparisons, we put the clustering results of three kinds of clustering 
algorithms into Table 5. 



Table 5. Clustering results of three kinds of clustering algorithms 



Class 


SVNMST clustering algorithm 


IFMST clustering algorithm 




Fuzzy MST clustering 
algorithm 


1 


Mi, a 2 , a 3 , a 4 , a 5 , a 6 } 


Mi, a 2 , a 3 , A 4 , a 5 , a 6 } 


{A!, A 2 , A 3 , A 4 , A 5 , A 6 } 


2 


M 2 }. Mi, A3, a 4 , a 5 , a 6 ) 


MA, M 2 ,a 3 , a 4 ,a 5 ,a 6 } 


Ml 


}, M 2 , a 3 , a 4 , A 5 ,A 6 j 


3 


MA, M 2 }, M 3 , a 4 , a 5 ,a 6 } 




Mi 


},M2,a 3 ,a 5 ),M4,a 6 } 


4 


Mi), M2}, M3.A5}, M 4 ,a 6 ) 


MA, M 2 }, Ms}, {A 3 ,A 4 ,A 6 } 


Mi 


}, M 2 ,a 3 ,a 5 },m 4 }, Ms) 
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5 

6 



{A 1 },{A 2 }, {A 3i A 5 }, {A 4 }, {A 6 } 
{A!}, {A 2 }, {A 3 }, {A 4 }, {A 5 }, 

{a 6 } 



{A!}, {A 2 }, {A 4 }, (A 5 } , {A 3 , 

a 6 1 

{A!}, {A 2 J, { A 3 }, { A 4 } , (A 5 ), 
{A 6 } 



{AW, {A 2 }, {A 3 , A 5 }, {A 4 }, 
(A 6 ) 

{AW, {A 2 }, {A 3 } , { A 4 } , {A 5 }, 
(A 6 ) 



From Table 5, we can see that the clustering results of the three clustering algorithms are quite 
different. The main reason can be given by the following comparative analysis. 

As is known to all, the single valued neutrosophic information is a generalization of 
intuitionistic fuzzy information, and intuitionistic fuzzy information is a further generalization of 
fuzzy information. On the one hand, a SVNS is an instance of a neutrosophic set, which give us an 
additional possibility to represent uncertainty, imprecise, incomplete, and inconsistent information 
which exist in real world. It can describe and handle indeterminate information and inconsistent 
information. However, the connector in the fuzzy set is defined with respect to T, i.e. membership 
only, hence the information of indeterminacy and nonmembership is lost. The connectors in the 
intuitionistic fuzzy set are defined with respect to T and F, i.e. membership and nonmembership only, 
hence the indeterminacy is what is left from 1, and then the IFS can only handle incomplete 
information but not the indeterminate information and inconsistent information. While in the SVNS, 
its truth-membership, indeterminacy-membership, and falsity-membership are represented 
independently, and then they can be defined with respect to any of them (no restriction). So the 
notion of SVNSs is more general. On the other hand, the clustering analysis under single valued 
neutrosophic environment is suitable for capturing imprecise, uncertain, and inconsistent information 
in clustering the data. Thus, the SVNMST clustering algorithm clusters the single valued 
neutrosophic information, while the IFMST clustering algorithm clusters the intuitionistic fuzzy 
information and the fuzzy MST clustering algorithm clusters the fuzzy information. Obviously, the 
SVNMST clustering algorithm is the extension of both the IFMST clustering algorithm and the 
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fuzzy MST clustering algorithm. Therefore, compared with the IFMST clustering algorithm and the 
fuzzy MST clustering algorithm, the SVNMST clustering algorithm is more general. Furthermore, 
when we encounter some situations, which are represented by indeterminate information and 
inconsistent information, the SVNMST clustering algorithm demonstrates its great superiority in 
clustering those single valued neutrosophic data. 

6. Conclusion 

This paper defined a generalized single valued neutrosophic weighted distance and proposed 
the SVNMST clustering algorithm as a generalization of the IFMST clustering algorithm. Through 
the computational tests on the SVNMST clustering algorithm, the IFMST clustering algorithm and 
the fuzzy MST clustering algorithm, the clustering results have shown that the SVNMST clustering 
algorithm is more general and more reasonable than the IFMST clustering algorithm and the fuzzy 
MST clustering algorithm. Furthermore, in the situations which are represented by indeterminate 
information and inconsistent information, the SVNMST clustering algorithm demonstrates its great 
superiority in clustering those single valued neutrosophic data, since the SVNSs are a powerful tool 
to deal with uncertainty, imprecise, incomplete, and inconsistent information. In the future, the 
developed algorithm can be applied to many areas such as information retrieval, investment decision 
making, and data mining. 
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